Environmental data mining and modeling based on machine learning algorithms and geostatistics
نویسندگان
چکیده
The paper presents some contemporary approaches to the spatial environmental data analysis, processing and presentation. The main topics are concentrated on the decision–oriented problems of environmental and pollution spatial data mining and modelling: valorisation and representativity of data with the help of exploratory data analysis, topological, statistical and fractal measures of monitoring networks, spatial predictions and classifications, probabilistic and risk mapping, development and application of conditional stochastic simulation models. The set of tools used consists of machine learning algorithms (MLA) – Multilayer Perceptron, General Regression Neural Networks, Probabilistic Neural Networks, Radial Basis Function Networks, Support Vector Machines and Support Vector Regression, and recently developed geostatistical predictive and simulation models. The innovative part of the report deals with integrated/hybrid models, including ML Residuals Kriging/Cokriging predictions, ML Residuals Simulated Annealing/Sequential Gaussian simulations. The objective of the integrated models is twofold: from one side ML algorithms efficiently solve problems of spatial non-stationarity, which are difficult for geostatistical approach; from another side geostatistical tools are widely and successfully applied to characterise the performance of the ML algorithms, analysing the quality and quantity of the spatially structured information extracted from data by ML. Moreover, mixture of ML data driven and geostatistical model based approaches are attractive for decision-making process.
منابع مشابه
Machine learning algorithms in air quality modeling
Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...
متن کاملEvaluating machine learning methods and satellite images to estimate combined climatic indices
The reflections recorded on satellite images have been affected by various environmental factors. In these images, some of these factors are combined with other environmental factors that cannot be distinguished. Therefore, it seems wise to model these environmental phenomena in the form of hybrid indicators. In this regard, satellite imagery and machine learning methods can play a unique role ...
متن کاملAccuracy Improvement of Mood Disorders Prediction using a Combination of Data Mining and Meta-Heuristic Algorithms
Introduction: Since the delay or mistake in the diagnosis of mood disorders due to the similarity of their symptoms hinders effective treatment, this study aimed to accurately diagnose mood disorders including psychosis, autism, personality disorder, bipolar, depression, and schizophrenia, through modeling and analyzing patients' data. Method: Data collected in this applied developmental resear...
متن کاملPersonal Credit Score Prediction using Data Mining Algorithms (Case Study: Bank Customers)
Knowledge and information extraction from data is an age-old concept in scientific studies. In industrial decision-making processes, the application of this concept gives rise to data-mining opportunities. Personal credit scoring is an ever-vital tool for banking systems in order to manage and minimize the inherent risks of the financial sector, thus, the design and improvement of credit scorin...
متن کاملApplication of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Environmental Modelling and Software
دوره 19 شماره
صفحات -
تاریخ انتشار 2004